ADAPTIVE OPTIMAL SIGNAL (CARDIOGRAM) PROCESSING, 
with boundary values and energy precisely measurement. 



E.Ostrovsky, L.Sirota Israel. 

"O.L.E." Company; Rehovot, 76842, Israel; 
e - mails: galo@list.ru; eugostrovsky@list.ru; 

Department of Mathematics and Statistics, Bar-Ilan University, 59200, Ramat Gan. 

e - mail: sirota@zahav.net.il 

Abstract. 

< 
in 



We construct an adaptive asymptotically optimal in order in the weight Hilbert space 
norms signal denoising on the background noise and its energy measurement, with hight 
precision near the boundary of the signal. 

An offered method used the Fourier-Riesz expansion on the orthonormal polynomials, 
for instance, Jacobi's polynomials, relative unbounded near the boundary weight function. 
An applications: technical and medical, in particular, cardiac diagnosis. 
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1 Introduction. Statement of problem 

A mathematical model: the cardiogram / = f(x),x G [—1,1] is Rieman integrable un- 
known function observed in the points {x(i) = x(i, n), i = 1, 2, . . . , n} on the background 
additive white noise e(z), on the other terms: REGRESSION PROBLEM (R): 

£(i) = /(z(i)) (0) 

where {e(z)} is an additive noise; e(i) are independent centered: Ee(z) = normed random 
variables (r.v.), Var[e(z)] = 1, with unknown distribution; in particular, r.v. e(z) not 
necessary to be Gaussian distributed; so that 

cr 2 = Var [£(«)] = const e (0, oo), 

x(i) = x(i,n) = (2i — n — l)/(n — 1), (n > 15), 

/ = f(x) be a Rieman integrable unknown function (on the other terms, signal, for 
instance, cardiogram). 

Our aim is to offer the so-called adaptive optimal in order as n — > oo in the 
modular Hilbert space sense estimation of a function (signal) / = f(x) and its 
energy of arbitrary order with a greet precision especially near the boundaries 
x = ±1. 



In the ordinary Hilbert spaces this problem is considered in many publications; see, 
for instance, [1], [2], [3], [4], [7], [8], [11], [15] and reference therein. 



2 Notations and definitions 

1. Modular Hilbert spaces. Let X = [—1, 1], x E X, $ = {(f) k (x)}, k = 1, 2, ... be some 
fixed complete bounded: 

sup sup |0fc(x)| < oo, 
k x6[-l,l] 

uniform Lipshitz continuous: 

\<Pk(x) - 4> k {y)\ < C ■ k ■ \x - y\ 

orthonormal polynomial system relative some non-trivial non-negative integrable weight 
signal function 7 = 7(2;) : 

(<f) k , 0;) 7 := J ^ (p k (x)(j)i(x)'j(x)dx = S(k, I), 
5(k, I) is the Kroneker's symbol; 

deg0 fc = k - 1, k = 1,2, ... . 
We impose henceforth the following conditions on the weight function j(x) : 7(2;) > 0, 

K{n) = (27T)- 1 j l = dx = (27T)- 1 f 7 (cos0)# < 00; (l.a) 

J -l V 1 — X 2 J-TT 

J_ i - / ===dx>-oo- (1.6) 



3k = const < 1/2, Mx e (-1, 1) \Y (x)\ < C [(x + l)""" 1 + (x - l)'^ 1 ] . (l.c) 
We redefine the weight function 7 = 7(0?) at the boundary points x = ±1 as follows: 

7 (-l) = 7 (+l) = 0. 
The conditions (l.a), (l.b) and (l.c) are satisfied, e.g. for the weight function 

la ,p{x) = (1 - x) a (l + xf, a,P> -1/2, x e (-1, 1); 7«,/j(±1) = 0, 
for the classical (normed!) Jacobi's polynomial, which we will denote 

<t>i^\x): deg(4 a ^(x)) = k-l, 

l'/ k a ^\x)^\x) la Ax)dx = S(k,l). 

As long as we need to estimate with greet precision the behavior of unknown function f(x) 
near the points x — ±1, we will suppose further that 



Note that 



a,0e (-1/2,0]. 
K{ la ^) = 7T- 1 B(a + 1/2, + 1/2), 



(2) 



obviously when a, /3 > —1/2; B(a,f3) denotes ordinary Beta- function. 

Many used for us information about orthogonal polynomials see in [5], [13], [14]. For 
instance, there are described the recurrence relations of a view 

<j) k+2 {x) = (A(k; $)x + B(k; <f>))(f) k+1 (x) + C(k; <5>)4> k {x), k = 0, 1, 2, . . . , 
with initial conditions 



0o (z) 



-1/2 



=: Co = C ( 7 ), 



where 



For instance, 



0i(x) = A(x - Ci), 

d = d( 7 ) = ^ 

i-i " (■'■)<!■>■ ' 

A- 2 = A- 2 ( 7 ) = J l (x - d) 2 7 (a;) cfc. 
^ 7a,/3(^)^ = 2 a+/3+1 B(a + 1,(3 + 1), 

Cl(7a,/3) 



— a 



a + /3 + 2' 
A- 2 (7 a ,/3) = 2 a+/3+1 J B(a + l,/3 + l)x 

3a 2 /j + 3a/3 2 + 3a 2 + 3/3 2 + lQa/3 + Ua + U/3 + 12 
(a + /3 + 3)(a + /3 + 2) 2 ' 

These recurrent relations with correspondent initial conditions may be used for the 
programming of offered algorithms. 

Remark 1. By our opinion, it is convenient for the practical using to choose the 
interval (0,1) instead (-1,1) and the weight function 



7W 



x 



a-l, 



i - xY- 1 



, xe (0,1); 



B(a,f3) 

7(0) = 7(1) = 0; a, (5 > 0; i.e. 7(2;) is the density of the well-known Beta distribution. 
Then 4>o(x) = 1, 



0i (x) 



(a + f3)x — a 
y/aP/(a + /3 + l)' 



The useful values of the parameters a, (3 are following: 

a,pe (1/2,1]. 



For arbitrary integrable function / : [— 1, 1] — > R we put 

oo 

f( x ) = E c W^( x )i 
fc=i 

here 

c(/c) = y ^ /(x) fc (x) 7(x) 

be the Fourier-Riesz coefficients of a function /(•) over the system $. 

Let w = w(k), k = 1, 2, ... be some positive function (sequence), for example, w(k) = 
k m , m > 0. The value of m is said to be the order of energy; not necessary to be integer. 

We will called the function w = w{k) the energy weight, in contradiction to the notion 
signal weight 7 = ^{x). 

We introduce the following Hilbert modular space B = B(Q,w), which will called 
modular Hilbert weight space, consisting on all the (measurable) functions {/} with finite 
quadratic functional 

00 

W(w)[f] = W($,w)[f] d = f Y.™(k)\c(k)\ 2 . (3) 

k=i 

The Hilbert's norm in this space may be defined as usually: 

\\f\\W(w) = y/W(w)[f\. 

For instance, may be integral on the square of derivative of an arbitrary order 

of the signal /(•) : 

W(w)[f]= f_y\t)fdt 

or its weight modification: 

W(w)[f] = J\(t)[f (r \t)f dt 

In the case when w(k) = w(k) = wg(k) x k 9 , 9 = const > we will denote W(&, wg) = 
W e ($) = W e . 

Evidently, if w(k) — wo(k) = 1, then 

00 

E4 = imi 2 (^o) = Cf,/) r 

k=l 

For instance, the Sobolev's spaces W™ = W™[—1, 1], m = 0, 1, 2, . . . consisting on all 
the periodical (mod 2) m— times differentiable functions / with finite norm 



^2[-l,l]) 2 + (||/ (m) ||L 2 [-l,l])^ 



nl/2 



is weighted Hilbert modular space relative the trigonometric system T. 
The functional 



W(f) = W(f;$,w) = \\f\\ 2 W($,w) 

will be called Energy or more exactly w — Energy of the signal /. We denote any 
estimation of W(f) as W(n, f). 

2. Loss functions. We define the following loss function L = L(g(n, •), /(•)) for 
an arbitrary estimation of the function / (more exactly the sequence of estimations) 

g(n,x) = g(n,x; as 

L f (g(n, •), /(•)) = L^(g(n, •), /(•)) =' E|b(n, •) - f(-)\\ 2 r (4.a) 

We denote as for the arbitrary energy estimation W(n, f) as M(W(n, /), W(f)) the 
following loss function (functional): 

Z(W(n, /), W(f)) d M E(W(n, f) - W{f)f. (4.6) 

3. Denotations. We denote: 

N N 

S(N; m) = ]T w m (k), S(N) = S(N, 1) = £ 
fc=i fc=i 



then 



Pu,(N;f) = p v (N)= £ «;(*) |c(A;)| 2 , p(iV) = £ |c(A;)| 2 , (5) 

fc=JV+l fc=JV+l 



/G^($, W )^ lim p(N;f,l) = 0. 

N—>oo 



3 Adaptive signal estimation: denoising 

1. Offered estimations. Let us denote the truncated Fourier- Riesz's sum for the func- 
tion f(x) as follows: 



f(N)(x) = J2 c ( k )M x )> 



k=l 



and correspondingly 



JV 

f(N,n)(x) = J2c n (k)(/) k (x), 



k=l 

where 



c n (k) = n 1 ^ /(a;(i, ra)) • n)) • 7(a;(i, ra)). 



i=i 



The non-random variables c n {k) are usually consistent approximations of a Fourier 
(or Fourier- Riesz) coefficients c(k); they are Rieman integral sums for the function 

f(x)(f) k (x)^(x). 

Let us consider a so-called projection, or Tchentzov's estimation f(N,n,x) of the 
function /(•) of a view 

N 

f(N, n, x) = c ( k > n )<t>k{x). (6a) 
k=i 

where random variables c(k, n) are usually consistent estimations of a Fourier coefficients 
c(k) based on the date i — 1, 2, . . . , n, namely, for the considered regression problem 
R 

n 

c(k, n) = n" 1 f («) • <t>k{x{h n)) • l(x(i, n)). (66) 
i=i 

We denote also 

N 

A(N,n)=p(N) + —. 

n 

Notice that 

A(N,n) x E||/(JV,n, •) - f(-)\\*, N — 1,2, ... , [n/3]; 
A*(n):= min A(N, n); N := argmin A(N, n). (7) 

Ne{l,n/3] JV6[l,n/3] 



We assume in addition that 



lim — = = 0. (7a) 

This condition guarantee us that 

lim l-fo."-/^*)!' = o. {7b) 
n->oo A*(n) 

For example, if 7(2;) = 1 and p(N) x N~ 2A ,N — > 00, then the last condition implies 
the well-known inequality A > 1/2, see, e.g. [6], [8]. 

Analogous conclusion is true when p(N) x N~ 2A L(N), where L(N) is slowly varying 
as N — )> 00 function. 

The function iV — > N~ 2A L(N) is called regular varying as — > 00 function. Used 
for us facts about slowly and regular varying function see in the classical monograph of 
E.Seneta [12]. 

The condition (7a) holds also when p(N) x N~ 2A L(N), A > 1/2, and if the signal 
weight function 7(2;) satisfied the conditions (la), (lb) and (lc). 

Further, we introduce the following important (random) statistics (functionals): 

2N 

r(N,n)= \c(k, n) | 2 ; M = M(n) = argmin r(N,n), (8) 

k=N+l Ne[l,n/3] 

r*(n) = min r(N,n) = r{M{n),n) (9) 

Ne[l, n/3\ 



and we introduce our estimation f(n,x) of a function f(x) as follows: 

M(n) 

f(n, x) = f(M(n),n, x) = £ <k, n)<j> h {x). (10) 

fc=i 

Note that M(n) is a random sequence which depended only on the source sample 
{£(»)}» i = 1, 2, . . . , n (adaptivity). 

2. H^e suppose also 

< lim^ ooP (2iV)/p(iV) < ImT iV _, 00 p(2iV)/p(iV) < 1. (11a) 
Another case, i.e. when, e.g., 

p(2N)/p(N) > 0, lim p(2N)/p(N) = (12) 

N— >oo 

may be investigated analogously. 

3. Obviously, in the considered here regression problem R we must impose some 
conditions on the errors of measurements e(i). We suppose in addition that the centered 
and normed variables {e(i)} satisfy the following condition: there exist a finite positive 
constants q, Q such that for all positive values u 

max \P{e{i) > u), P(e(i) < -u)} < exp (-(u/Q) q ) . (13a) 

It may be assumed in many practical cases that the r.v.{e(i)} are Gaussian distributed; 
in this case the inequality (13a) holds with parameters q = 2 and Q — 1. 



4 Main result 

If we choose N = N , then we conclude that the optimal speed of convergence for the 
non-adaptive estimation is asymptotically A*{n) : as n — > oo 

E||/(n,AV)-/(')H 2 ^>)- (14) 

At the same result is true for our adaptive estimation. 

Recall that we suppose the conditions (la), (lb), (lc), (7a), (11a) and (13a) to be 
satisfied. 

Theorem 1. Let K be arbitrary regular function space with a norm 1 1 • | \K compactly 
embedded in the space B = B(<&,w). We denote the minimax value of the loss function 
as A*(n) = A*(n, K,Z),Z = const G (0, oo) : 

A*(n, K, Z) = f inf sup L(g(n, •), /(•)), (15) 

9M feK,\\f\\K<Z 

where " inf" is calculated over all, including even non adaptive estimations of /(•). 

We assert that under conditions for adaptive estimation our estimation f(n,-) is 
asymptotically as n — > oo optimal up to finite multiplicative constant in the minimax 
sense on the arbitrary regular function classes K compactly embedded in the modular 
space W = W($,w): 



sup L(f(n,.)J(.))<C(>y,<i>,K,Z).A*(n,K,Z), C( r $, K, Z) < oo. (16) 
feK,\\f\\K<z 

5. Variance estimation. Note that the consistent estimation a 2 (n) of the 
value a 2 may be constructed by formulae [8], [11] 

a\n) = (n- M(n) - l)' 1 £ (f(n, x(i, n)) - . (17) 

i=\ 

6. Confidence region. For the rough building of the confidence domain, also 
adaptive, in the £(<&, 7) norm we proved the following result. 

Theorem 2. We assert under at the same conditions as in theorem 1 that there exist 
positive finite non-random constructive and estimative constants Yj = ^(7, K, Z), j = 
1, 2 and for the problem (R) Yj = Yj(j, K, Z; q, Q) such that 

lim^T^n)/^, K, Z) = Y 1 (18) 

and correspondingly 

linT^ooll/^, •) - /(•))!!») = Y 2 . (19) 
Therefore, we conclude: with probability tending to one as n — > 00 

||/(n,-)-/(-)ll?<l-05-y 2 -r». (20) 
More exact computation show us that 

E||/(n,.)-/(OII?~Er», (21) 

Var||/(n, •) - f(-)\\ 2 ~ 4a 2 ^ 2 ( 7 )p(M(n))/n + 3a 4 (n)K 4 ( 7 )M(n)/n 2 = J <5 2 (n), (22) 
where 

r*(n) ~ p(M(n)) + a 2 (n)fT 2 (7)M(n)/n. (23) 
Therefore, with probability tending to 0.95 

IliW) -/(OH? <r*(n) + 2.54 • <5(n). (24) 



5 Energy estimation 

In order to detect a distortion in the signal, for instance, infarction, heart attack, apoplexy 
in the cardiodiagnostics; leakage in the pipelines, in particular, in the nuclear reactors we 
offer to estimate its energy and comprise with it standard value up to confidence domain. 
1. Ordinary energy. 

Let us define for simplicity the so-called ordinary energy functional 



G = G(f) = Y,c 2 (k) = \\f\\ 2 . (25) 

k=l 

More exactly, we intend to consistent estimate (consistently) the truncated energy 
functional of a view 

N (n) 

G = G(f)= E c 2 (k) = \\P No(n) f\\ 2 v (26) 

k=l 

where Pjv (n)/ denotes the projection of the function /(•) onto the subspace generated by 
the first N (n) polynomials (f)k(-), k — 1, 2, . . . , . 

But it is evident that asymptotically as n — > oo G — > G(f). 

If we define as a first approximation the following estimation for the functional G the 
following expression 

M(n) 

Go(n)= E c 2 (k,n), (27) 
k=i 

then we obtain 

M(n) 

Go(n) ~ E ( C W + <rK(i)ri(k)/y/n) 2 , (28) 
fc=i 

JV (n) 

EGo(n) ~ E ^ + ^ 2 ^ 2 (7)iVoH/n, (29) 

k=l 

therefore an asymptotical non-biased estimate of a G = G(f) based on the G (n) has a 
view 

M(n) 

G(n) := E c 2 (A;, ra) - M(n) K 2 ^) a 2 {n)/n + r*{n). (30) 
fc=i 

Theorem 3. As n — > oo the r.v. G(n) has Gaussian distribution with parameters 

G(n) ~ iV(G, 4fT 2 ir 2 ( 7 )G'/n). (31) 
2. Energy of general order. Let us define 

W w (f,n,N) = y £w(k) 
k=i 

We consider in this pilcrow only the case w(k) = we(k) x k 9 ,9 = const > 0, k — > oo; 
the case when w(k) is regular varying as k — > oo may be considered analogously. 
We find by direct calculation under assumption f G W w 2 : 

Z(W w (f; n, N), W w (f)) x n' 1 + p 2 w (N) + n~ 2 N 2e+ \ (33) 

If we choose 



\k,n) 



— n 



-i 



^ 2 ( 7 y 2 (n) + r;(n). 



(32) 



N w = argmin (p 2 w (/; N) + N 20+1 /n 2 ) , 

we obtain the optimal in order non-adaptive as n — > oo estimation of the energy W w (f) 
of a view 

W w (f) = W w (f,n,N w ). 
For example, if p w (f; N) x iV~ 2A , A = const > 0, then 

N w = Ent[n^ 2A+d+1 /% 
Ent[z] denotes the integer part of the number z, and 

Z(^(/;n,iVj ) ^(/))xmax(n- 1 ,n- 4A /( 2A+2m+1 / 2 )). 
Notice that in the case when A > 0.5^ + 1/4 

Z(W w (f, n, N w ), W w (f)) x n-\ (34) 
i.e. optimality in order as n — > oo. 

In order to construct an adaptive estimation of the variable W w (f), we introduce the 
following statistics: 

2N 

r w (N) = r w (N,n) = ^ w(k) c 2 (k,n), 

k=N+l 

N w = argmin t w (N), 

N<n/3 

and introduce the following adaptive estimation (measurement) of energy : 

W w = W w (f;N w ,n). 



Theorem 4. Suppose / G W 2 e and 

Pw(f] N) x iV- 2A L(iV), A = const > 0, 
where as before L = L(N) is slowly varying as iV — > oo function. Proposition: 

Z (W w , W w ) < K 4 (A, 9) ■ max (n~\ n -^/(^+^ . (35) 
Notice that if A > 6/2 + 1/2, then 

z(w w ,W w )<K 5 (A,e)-n-\ (36) 
(optimality in order as n — > 00 of offered adaptive energy estimation). 



6 Sketch of proofs. 



1. Denoising. We have the expansion: 

n n 

c(k, n) = n' 1 J2^(i)(pk(x(i, n))-f(x(i, n)) = n" 1 J2 M x (h n))-f(x(i, n)) f(x(i, n))+ 
i=i i=i 

n~ 1 e(i)'y(x(i,n))(j)k(x(i,n)) ~ c(k) + rT 1 / 2 ar](k). (37) 

The r.v. {rj{k)} are centered, asymptotically independent and normal distributed with 
variance as n — > oo 

n „! 

Var[r](k)] = n' 1 ]T <j> 2 k (x(i, n))-f 2 (x(i, n)) ->• / (g{x) 1 2 {x)dx. 
Further, as k — > oo 

Varfo(*)] £ ^==^ = ^ Hi)- (38) 
On the other words, we get as a first approximation the following expression: 

c(k, n) ~ c{k) + d( 7 ) a n~ 1/2 C(A;), 

where {C(^)} are independent and standard normal distributed r.v. 
We find by direct calculations: 

A(N, n) x p(N) + n~ l N; (39) 

~E[t(N, n)] x p(N) + n^iV = A(N, n); (40) 

Var[r(iV, n)] x n~ 2 N + n" 1 p(JV). (41) 
We conclude by virtue of adaptive conditions that for some positive constant C — C{^) 

Var[r(iV, n)\ < C n~ l ■ A(N, n). 

Further considerations are alike to the [1], [10], chapter 5, section 13. 
Notice that with probability one 

min t(N, n) x A*{n), n ->■ oo. (42) 

N£[l,n] 

see [1], [10], chapter 5, section 13. 

2. Ordinary energy estimation. We have 

2aK(>y) M{n) a 2 K 2 (>y) M{n) 

G(n) - G = 2 -^m £ c(k) V (k) + a -^± £ (V 2 (k) - 1) ~ 

v n k=l n k=l 



L c(k)r]{k). 



Therefore 

N (n) 

Var[G(n) - G] ~ 4<r 2 K 2 ( 7 )n- 1 £ c 2 (A;) ~ 

fc=i 

oo 

4 ( r 2 K 2 ( 7 )n- 1 ^ c 2 (k) = 4:<j 2 K 2 (~f)G/n. (43) 
fc=i 

The asymptotical normality follows from the classical CLT. 
3. We prove here the implication (7a) — > (76). 

Denote by F n (x) the empirical function of distribution for the sequence x(i); F(x) : = 
0.5(x + 1), x E [—1, 1]; obviously, 

sup \F n (x) -F(x)\ < 2/n. 

16 [-1,1] 



Further, put 



cr n ,k = J [f(x)(j)k(x)n/(x)]d(F n (x) - F(x)), 

S M = \ \f(N) ~ f{N,n)\\*l 



then 



Wo 

s(n) = E<*- 

k=l 

We can and will assume without loss of generality that the function /(•) has a bounded 
derivative. 

We have for arbitrary values e G (0, 1/2) : 

K,k\ < j ^ [f(x)<f> k (xMx)\ d{F n {x) - F(x))+ 
f 1 ^ lf(x)<f> k (xMx)]d(F n (x) - F(x))+ 

J-l+e 

f [f(x)Mxh(x)]d(F n (x) - F(x)) = <r$ + ^ + 

Jl—e 

kSl<Ci j^ +e l(x)dx < C 2 e^ 



and analogously may be estimated the value \cr^\\ 



Further, 



\n {2) \ < r 

\ a n,k\ S ^3 



a (4) + a {5) 



where 



a 



(4) de_f 
a n,k — 



(5) dg 

n'k 



1-e 



1+e 



\F n (x) - F{x)\\(t)' k {x)\-i{x) dx < C 4 k/n; 



1-e 



1+e 



\F n (x)-F(x)\\<j> k (x)y(x)\ dx < C 5 e~ K /n. 



Therefore, 
We conclude choosing e = 0.25/n : 

and following 



- + + e 

n n 



A; 2 1 
+ 



l-K 



2(1-k) 



rr n 



lim < lim -p*L = 0. 



4. The proof of theorem 4 in completely analogous; see, for example, [4] 
Indeed, we have as n — > oo 



2aK(>y) Nw{n) cr 2 K 2 (>y) Nw{n) 

wun)-w w ~- Pw (N w )+^m y: c(k) V (k) W (k)+^^- y: ^(ww 



n 



k=i 



n 



k=l 



therefore 



E(W w (n) - W w ) 2 ~ Pw(N w ), 



Var(W w (n) - W w ) ~ - K l7j £ c 2 (AV(A:) + £ w 2 (fc); 



k=l k=l 

and we get under formulated above conditions, in particular, S w 2(f) < oo : 
Z(W w (n), W w ) < C \ P 2 W {N W ) + n 1 + S^fi/n 



etc. 



7 Concluding remarks, applications 



0. The optimality in order as n — > oo of described above signal denoising was proved by 

1. A.Ibragimov; see also the monograph [15], chapter 4. 

1. We conclude making the so-called Fisher's transform for the ordinary energy estimation 
that 

y/G{n) ~ N(VG,Aa 2 K 2 (-f)/n). 

This fact allows us to built a confidence interval for the value G. For example, with 
probability 0.95 

|v^-VGW|<5^M52). (44) 

2. Let us return to the problem of diagnosis, for example, cardiodiagnostics. The offered 
estimations with corresponding confidence regions allow us to detect the distortions for 
obtained signal, for instance, in the cardiogram. 

Namely, if in the current time interval the energy value does not belongs to the con- 
fidence region (or does not belongs to this region twice or three times,) we can conclude 
the presence of some distortion in the signal, in particular, infarction etc. 
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